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Abstract 

Background: Epigenetic marks superimposed on the DNA sequence of eukaryote chromosomes provide agility 
and plasticity in terms of modulating gene expression, ontology, and response to the environment. Modulating the 
methylation status of cytosine can generate epialleles, which have been detected and characterised at specific loci 
in several plant systems, and have the potential to generate novel and relatively stable phenotypes. There have 
been no systematic attempts to explore and utilise epiallelic variation, and so extend the range of phenotypes 
available for selection in crop improvement. We developed an approach for generating novel epialleles by 
perturbation of the DNA methylation status. 5- Azacytidine (5-AzaC) provides selective targeting of ^^CG, which in 
plants is associated with exonic DNA. Targeted chemical intervention using 5-AzaC has advantages over transgenic 
or mutant modulation of methyltransferases, allowing stochastic generation of epialleles across the genome. 

Results: We demonstrate the potential of stochastic chemically-induced hypomethylation to generate novel and 
valuable variation for crop improvement. Systematic analysis of dose-response to 5-AzaC in B. rapa guided 
generation of a selfed stochastically hypomethylated population, used for forward screening of several agronomic 
traits. Dose-response was sigmoidal for several traits, similar to that observed for chemical mutagens such as EMS. 
We demonstrated transgenerational inheritance of some phenotypes. BraRoAZ is a unique hypomethylated 
population of 1000 E2 sib lines. When compared to untreated controls, 5-Aza C-treated lines exhibited reduced 
immuno-staining of ^^C on pachytene chromosomes, and Methylation Sensitive Amplified Polymorphism (MSAP) 
profiles that were both divergent and more variable. There was coincident phenotypic variation among these lines 
for a range of seed yield and composition traits, including increased seed protein content and decreased oil 
content, as well as decreased erucic acid and corresponding increases in linoleic and/or palmitic acid. Each 
5-AzaC-treated line represents a unique combination of hypomethylated epialleles. 

Conclusions: The approach and populations developed are available for forward and reverse screening of epiallelic 
variation and subsequent functional and inheritance studies. The generation of stochastically hypomethylated 
populations has utility in epiallele discovery for a wide range of crop plants, and has considerable potential as an 
intervention strategy for crop improvement. 
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Background 

Epigenetic marks superimposed on the DNA sequence 
of eukaryote chromosomes have the potential to provide 
agility and plasticity in terms of modulating gene expres- 
sion, ontology, and response to the environment. These 
marks affect chromatin structure, and include cytosine 
methylation of DNA and modifications to histone pro- 
teins. Epialleles have been detected and characterised at 
specific loci in several plant systems, resulting from vari- 
ation at specific sites or in global patterns of DNA 
methylation or as histone variants [1]. Some de novo 
epialleles can confer novel heritable phenotypes over 
several generations [2,3]. An increasing body of evidence 
indicates that many crop agronomic traits are likely to 
be affected to some extent by stably inherited epigenetic 
modifications [3-5]. 

The extent to which inadvertent selection of epiallelic 
variation has contributed to the major increases in crop 
yield or quality achieved through selective breeding over 
the past century remains a mystery. Although epialleles 
have the potential to generate novel and relatively stable 
phenotypes [2,3,6], to date there have been no systematic 
attempts to explore and utilise epiallelic variation, and 
so extend the range of phenotypes available for selection 
in crop improvement. The scope for harnessing epigen- 
etic variation is demonstrated by recent findings of an 
association between DNA methylation and selection for 
energy use efficiency in Brassica oilseed crops [4], as 
well as the interaction between phosphate starvation re- 
sponse in Arabidopsis and the pattern of histone H2A.Z 
epigenetic marks [5]. H2A.Z is excluded from sites of 
heavy DNA methylation in actively transcribed genes 
[7]. Tissue-specific monoallelic de novo DNA methyla- 
tion at the SPll gene within the Brassica rapa pollen 
self-incompatibility locus S has been shown to contrib- 
ute to dominance relationships amongst S-alleles [8]. In 
Arabidopsis, imprinted states mediated by changes in 
DNA methylation and histone chemistry affect vernalisa- 
tion and flowering time [9,10], embryo development 
[11, 12] and seed size [11]. 

DNA methylation is regulated by a set of genes that dir- 
ectly catalyse the methylation process or indirectly influ- 
ence methylation status via chromatin remodelling. In 
mammals and other metazoans this is primarily achieved 
through the activity of DNA METHYLTRANSFERASE 1 
{DNMTl) which encodes the enzyme responsible for ^"^C 
maintenance, and DOMAINS REARRANGE METHY- 
LASE 1 (DRMl) which encodes enzymes responsible for 
de novo methylation at CG sites. Plants differ in the tar- 
gets, distribution and transmission of DNA methylation 
compared with animals. The Arabidopsis DNA METHYL- 
TRANFERASE 1 (METl) is required for catalysing cyto- 
sine methylation in CG context, resulting in ^"^CG [13,14], 
whereas the chromatin remodelling factor DECREASE IN 



DNA METHYLATION 1 (DDMI) is required for main- 
taining DNA methylation both at CG and CHG sites [15]. 

Specific patterns of DNA methylation can be transmit- 
ted through mitosis and cell lineages, as well as transge- 
nerationally through meiosis, mediated by METl 
[16,17]. The stability of epiallelic variation has been 
demonstrated in a number of plant systems, including 
Brassica napus [18]. In Arabidopsis, epiRILs have been 
established with variation and high heritability for flow- 
ering time and plant height, as well as stable inheritance 
of multiple parental DNA methylation variants over at 
least eight generations [19]. Subsequent phenotyping has 
demonstrated heritable variation for many fitness traits 
[20]. Whole genome analysis of DNA methylation in 
Arabidopsis has shown that CG sites tend to be targeted 
within coding sequences [21,22]. More recently, it has 
been found that these marks appear preferentially asso- 
ciated with exons [23]. 

One approach to generating novel epialleles is by per- 
turbation of the DNA methylation status [24,25]. This 
may include ectopic, tissue-specific or induced silencing 
of DNA methyltransferases and other methylation regu- 
lators in the genome. In Arabidopsis, down-regulation of 
METl results in a wide range of abnormal phenotypes, 
including decreased plant stature, smaller rounded 
leaves, decreased fertility and reduced apical dominance 
[6,14]. Although this has provided valuable insights into 
the role of cytosine methylation (^"^C) in gene function 
and development, there are drawbacks to employing 
metl mutants in the context of crop improvement, due 
to the widespread re-patterning of epigenetic marks 
within the genome. However, as with conventional func- 
tional gene analysis, there is potential to reveal a range 
of variants that may underpin epigenetic modulation of 
specific genes or traits. Such an approach would be 
facilitated by exploring a range of epialleles associated 
with specific target genes (reverse epigenetics) or pheno- 
typic traits (forward epigenetics). 

Targeted chemical intervention, using demethylating 
agents such as 5-Azacytidine (5-AzaC), provides an al- 
ternative to transgenic or mutant modulation of METl, 
5-AzaC is a structural analogue of the nucleoside cyti- 
dine with nitrogen atoms in place of carbon in the fifth 
position of the ring structure [26,27]. It is a potent in- 
hibitor of DNA methyltransferases and has been shown 
to be effective in reverting the hypermethylation of 
tumour suppressor genes and suppressing cancer- 
specific cellular phenotypes [26,28-31]. Since 5-AzaC 
interferes with METl, there is a selective targeting of 
^"^CG, which in plants is associated with exonic DNA. 
The use of 5-AzaC has a number of advantages over dir- 
ect down-regulation of METl, including potential for 
stochastic hypomethylation of target sites, rather than 
widespread hypomethylation that may mask more subtle 
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and valuable phenotypes, or result in lethal epialleles. 
This presents the opportunity to screen for specific 
epialleles that may then be stably introgressed into a 
wild type background. Naturally occurring epialleles 
have recently been shown to occur in near-isogenic lines 
of maize and to exhibit relatively stable inheritance [32] . 

Epigenetic intervention through generation of novel 
epialleles may be particularly tractable in the many crop 
plants of polyploid origin, where there is additional scope 
for paralogue- or homoeologue-specific gene regulation. 
The ability to sustain non-lethal variation is evident from 
the higher mutant loads attainable in EMS TILLING popu- 
lations for B. rapa [33], B. napus [34] and wheat [35]. Spe- 
cies within the genus Brassica have been domesticated into 
a remarkably wide range of vegetable, oilseed, fodder and 
condiment crops [36], including turnips, Chinese cabbage, 
pak choi, brocolleto, sarson and turnip rape of B, rapa 
(A genome, n=10) and cabbage, cauliflower, broccoli, 
Brussels sprout of B, oleracea (C genome, n=9). The A gen- 
ome is retained essentially intact within the amphidiploids 
of B, napus (AC genomes, n=19) giving rise to the major 
crops oilseed rape, Canola and swede/rutabaga and mus- 
tard rape/brown mustard of B, juncea (AB genomes, n=18). 
It remains unclear why Brassica species have the capacity 
to generate such a wide range of morphological forms, 
although this has been attributed to gene duplication, 
intergenomic heterozygosity and epigenetic phenomenon 
[37-39]. In the absence of direct evidence from whole gen- 
ome bisulphite sequencing, estimates of ^"^C methylation 
within Brassica have been based on immunoprecipitation, 
giving estimates of 16% of all cytosines within the genome 
[40], and methylation sensitive amplified polymorphism 
(MSAP) giving estimates from 16-57% of target sites [40-42]. 

Here, we demonstrate the potential of stochastic chem- 
ically induced hypomethylation to generate novel and 
valuable variation for crop improvement. We systematic- 
ally optimise the phenotypic dose-response to 5-AzaC in 
B, rapa and demonstrate the transgenerational inheritance 
of some phenotypes. We then use this information to gen- 
erate a stochastically hypomethylated population which 
was selfed and characterised for a number of agronomic 
traits including seed size and composition. We demon- 
strate changes in the distribution of ^"^C in euchromatin 
of pachytene stage chromosomes, variation in MSAP pro- 
files and alterations in the transcriptome. The approach 
and populations developed are available for forward and 
reverse screening of epiallelic variation and subsequent 
functional and inheritance studies. 

Results 

Plants phenotypes have a sigmoidal dose-response 
to 5-AzaC hypomethylation 

Dosage response of the plants was investigated after ex- 
posure to various concentrations of 5-AzaC (water 



control, O.OlmM, O.lOmM, 0.50mM, l.OOmM and 
1.50mM) during seed imbibition by examination of a 
range of phenotypic traits during plant development. 

Germination of El seed was not significantly affected 
(p>0.05) after being imbibed in 5-AzaC at concentra- 
tions up to l.OmM, although there was a small but sig- 
nificant decline (to 98% germination) following exposure 
to 1.5mM (p < 0.05). During the early stages of develop- 
ment, there were clear differences in gross phenotype in 
terms of stature and rate of growth (Figure 1). Plant 
growth was markedly stunted at concentrations of 5- 
AzaC above 0.1 mM (Additional file 1: Figure SI and 
Figure la). In some seedlings that failed to develop true 
leaves, root development was poor and plant stature 
conspicuously reduced. 

There was a significant effect on days to first anthesis 
(Figure Ic and Additional file 1: Figure S2), at 0.1 mM and 
above (p < 0.05). In general, there was an increase in days 
to flowering with increasing concentrations of 5-AzaC. 
The variance in days to flowering also increased with 5- 
AzaC concentration. Reduced pollen productivity and 
increased incidence of defective anthers was observed at 
higher 5-AzaC concentrations, although all plants with 
seemingly viable pollen were able to set at least some 
seeds. However, there was an overall reduction in seed set 
and seed weight associated with increased 5-AzaC con- 
centration (Table 1 and Additional file 1: Figure S3). 

The effect of 5-AzaC exposure also carried forward 
into the seminal generation. There was a small but sig- 
nificant (p < 0.05) reduction in E2 seed germination; 
down to 92% at concentrations of 1.0 mM 5-AzaC and 
above. Compared to plants from control seeds where an- 
thesis occurred at 35 days, there was also a reduction of 
2.8 days (p < 0.01) to first anthesis in a subset of E2 
plants at 0.5 mM 5-AzaC. Some E2 plants also exhibited 
chlorophyll sectoring. 

Careful analysis of two dose-response curves (Figure la 
and lb, relating to abnormal phenotype and survivorship) 
allowed selection of optimal concentrations of 5-AzaC to 
generate a hypomethylated population in B, rapa 
(Figure la, b). There was no significant (p > 0.05) lack of 
fit of the data for either of these curves. This test was 
based on a total of 24 observations for data in Figure la 
(four replicate observations at each of six concentrations), 
and 20 observations for data in Figure lb (four replicates 
at five concentrations). A concentration of 0.23 mM 
represented the concentration at which no more than 50% 
of the plants exhibited a phenotype of reduced stature 
and/or growth rate (Figure la). At a concentration of 
0.32 mM 5-AzaC around 70% of the population are 
expected to survive (Figure lb); this being the concentra- 
tion that reduces survival by 50% of the overall possible 
reduction given an estimated (significant, p < 0.05) lower 
asymptote. Accordingly, we adopted two concentrations. 
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Figure 1 Phenotypic response of B. rapa R-o-18 to 5-AzaC 
treatment at El generation. Lines drawn to the curve indicate tlie 
dose giving 50% reduction in normal plants. The vertical line at 
1.4mM indicates LSD (5%) values for comparison of means. The 
plotted points in (a) and (b) show the means of four replicate 
observations at each concentration, (a) Dosage response curve of 
plant stature 20 days after sowing. Lines drawn to the curve indicate 
the concentration that reduces survival by 50% of the overall 
possible reduction given an estimated (significant, p < 0.05) lower 
asymptote. The vertical line at 1.4mM indicates LSD (5%) values for 
comparison of means, (b) Survival curve 60 days after sowing, 
(c) Frequency distribution of days to first anthesis. Mean and 
variance are indicated in top right of each panel. 



0.25 mM and O.SOmM 5-AzaC for generation of the BraR- 
oAZ population. 

Establishment of the BraRoAZ DNA hypomethylated 
population (El to E3) 

Having identified suitable concentrations of 5-AzaC that 
would generate a population comprising plants that are 
likely to be fertile and yield good seed, and yet also show 
heritable phenotypic effects, we then applied these con- 
centrations to establish a hypomethylated El population 
of B. rapa line R-o-18 (BraRoAZ). Two sub-populations, 
each containing 250 plants, were established (BraR- 
oAZ_02a and BraRoAZ_02b) from seed treatments of 
O.SOmM and 0.25mM 5-AzaC respectively (Figure 2). 

Seed for a total of 1,892 E2 sib-lines were harvested. 
Phenotypic comparison of the El plants with the control 
(untreated) plants (Figure 3) allowed selection of 1,000 
E2 sib-lines (500 per original treatment) (Figure 2) for 
assessment of the E2 generation. These corresponded to 
50 original El plants per treatment, where atypical vari- 
ation (i.e. beyond that seen in the WT controls) had 
been observed in survival rate, flowering time, plant 
height, secondary branching and floral morphology. 

E2 selfed seed were harvested from 473 El lines, to- 
gether with 30 control (untreated parental) lines. For 
each El plant, seed from three separate siliques from the 
middle of the primary inflorescence were harvested and 
stored independently. In addition, seed from the 
remaining siliques for each plant were pooled. One seed 
was used per seed source to generate the E3 generation. 
Thus, there were four unique E2 seed accessions (E2 
sib-lines) per El plant (Figure 2). 

In general there was a lower level of morphological 
phenotypic variation in the E2 plants compared with the 
El plants but still higher than the controls. However, we 
did observe a low frequency of sectoring, reduced plant 
stature, smaller leaves, defective anthers, embryo abor- 
tion, reduced seed set and seed size. 

We next harvested selfed seed from 114 arbitrarily 
selected E2 sib-lines (from the 1,000 grown) correspond- 
ing to 28 original El plants, to generate an E3 
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Table 1 Effect of 5-AzaC treatment on seed set and seed weight 
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Response of B. rapa seed set (number of seeds per silique) and seed weight (mg) to 5-AzaC treatment. Plants were grown from seeds treated with different 
concentrations of 5-AzaC (0.01 mM, 0.1 mM, 0.5mM, I.OmM and 1.5mM) and a control. Seeds were harvested from nominated positions (sixth, seventh and eighth 
positions, from the apex of the primary inflorescence) of randomly selected El plants. A total of 20 plants in each concentration, consisting of four replications 
(five plants per replicate) were used for this analysis. The data were analysed by ANOVA and the table shows the mean, standard error of the mean, variance of 
observations, the standard error of the difference (SED) between means and the degrees of freedom (df). 



population. Seed from one selected silique were har- 
vested and stored independently, with the remaining 
seed pooled. These plants retained a higher variability 
than was seen in the controls, albeit less than the El. 

Variability in patterns of DNA methylation 

MSAP analysis was used to assess the extent to which 
treatment had perturbed global methylation profiles. 
The analysis was based on a sample of 10 seeds from 
five lines each of S2, E2 and E3 and two S3 lines. The 



capacity of 5-AzaC to inhibit methyltransferase activity 
led to the expectation that profiles generated by the 
methylation-sensitive Hpall and Mspl would diverge in 
untreated samples (since methylation will disproportion- 
ally affect the former profiles) but converge in treated 
samples (since the DNA templates lack the methylation 
that cause the difference). As expected, there was 
marked divergence revealed by Principal Coordinate 
Analysis (PCoA) between the Hpall and Mspl profiles in 
all untreated samples (Figure 4). However, there was 



R-o-18 



EO seed 

5-AZAC 
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Figure 2 An overview of the 5-AzaC-treated population BraRoAZ. A total of 500 El plants was raised from (EO) seeds treated with 0.25mM 
or O.SOmM 5-AzaC. From each of 473 El plants, seed were harvested from three siliques and assigned independent accessions. Seed were 
harvested from the remaining siliques and combined. 10 seeds (sib-lines) were sown from each of a subset of 100 El plants, selected based on 
morphological features such as plant stature, branching and floral morphology. From these E2 plants, 114 were randomly selected to obtain E3 
seed. Single siliques were harvested from each plant, and the remaining seed were combined. 
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Figure 3 Morphological variation observed in E1 plants from 

treatment with 5-AzaC at 0.25mlVI and 0.3mM. (a) Plants 

showing a continuum of plant stature and an example of a late 

flowering phenotype. (b) normal anthers (black arrow) and male 

sterile anthers (white arrow) were frequently observed on the same 

plant, (c) Reduced seed size in E3 sib line BraRoAZ_l 1714e3 

(left panel) compared to R-o-18 (right panel). Bar= 1cm. 
J 

considerable variance between the behaviour of the trea- 
ted lines. Four treated lines (BraRoAZ_11543e2, BraR- 
oAZ_12447e2, BraRoAZ_10261e3, BraRoAZ_10542e3) 
from the E2 and E3 generations showed a marked loss 
in discrimination between enzyme profiles relative to the 
untreated controls (Figure 4). Conversely, the remaining 
treated lines retained the wide divergence between the 
Hpall and Mspl profiles, and so were indistinguishable 
from the untreated control lines (Figure 4). The signifi- 
cance of these differences was estimated using PhiPT 
by AMOVA. Here, the same four lines were alone in 
generating sufficiently low PhiPT values to infer non- 
significant divergence between the Hpall and Mspl pro- 
files (0.1642<P>0.0006). All remaining lines produced 
the Hpall and Mspl profiles that were highly signifi- 
cantly divergent (P=0.0001) (Additional file 2: Table S2). 

We assessed the spatial distribution of ^"^C in chromo- 
somes by immuno-labelling in meiotic pachytene 
spreads of control S2 lines, and in four BraRoAZ_E2 
lines. Clusters of ^"^C signal co-localised with hetero- 
chromatic regions both in control plants and in BraR- 
oAZ_E2 plants but were notably reduced within the 
euchromatin of the BraRoAZ_E2 lines (Figure 5). 

Hypomethylation affects seed size and other 
yield components 

We next examined whether agronomically important 
aspects of seed morphology and composition had been 
altered by the treatment. Seed size was inferred indir- 
ectly from the area of seed in profile. Seeds from BraR- 
oAZ E2 lines were on average 0.14mm^ (SE 0.040) 
smaller than the S2 controls (Figure 6a; Wald statistic = 
11.97 on 1 df; p <0.001). Components of variation 
between lines were greater for the BraRoAZ E2 lines 
(variance = 0.125) than for the control R-o-18 S2 sib- 
lines (variance = 0.009, change in deviance compared to 
common variance = 40.4 on 1 df, P< 0.001). Similarly, 
variation between seeds within lines was greater for the 
BraRoAZ E2 lines (variance = 0.122) than for the control 
R-o-18 S2 sib-lines (variance = 0.083, change in deviance 
compared to common variance = 40.4 on 1 df, P<0.001), 
and a subset of BraRoAZ E2 lines exhibited a much 
greater range of seed size. BraRoAZ E3 lines also had a 
greater range of seed size compared with the S3 controls 
(Figure 5b). We found no evidence of correlation 
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Figure 4 Effect of 5-Azac on epigenetic instability. Principal coordinate analysis (PCoA) based on the Euclidean analysis of 
methylation-sensitive amplified polymorphism (MSAP) assays using primer combinations H2/E1 and H3/E3. Plant lines are colour coded as shown 
in key, with square symbols {HpaW) and circles (Mspl). a) Main graphic, PCoA showing mean epigenetic distances for each of the groups; 
b) Nested graphic, epigenetic distances for all the samples in lines BraRoAZ_12447e2 and BraRoAZ_12845s2. Euclidian space within circles 
represent intra-population variation (i.e. SSWP) for control line BraRoAZ_12845s2 using HpaW (dashed) and Mspl (dotted). Arrow indicates the 
epigenetic distance between BraRoAZ_12845e2 samples restricted with HpaW or Mspl (PhiPT). 



between seed size and seed number per silique in indi- 
vidual sib-lines and control lines. 

Seed yield within the BraRoAZ_E2 and BraRoAZ_E3 
populations was considerably more variable than in the 
corresponding S2 and S3 lines (Figure 6c, d). We ana- 
lysed a subset of 100 E2 lines and found that seven of 
the top 10 yielding lines were scored with profuse 
branching (increase in number of axillary branches sub- 
tending main stem) compared with nine of the 
remaining 90 lines. Seed yield per plant showed a strong 
correlation (r^=0.94) with the number of seeds per plant, 
which may be a direct consequence of the increased 
branching and/or increased fertility. However, the 
observed variation in seed size was poorly correlated 
with seed yield (r=-0.12), as well as with number of seed 
per plant (r=-0.32). This suggests that the effects of 
treatment on seed size are probably independent from 
those on other yield components. 



Seed composition 

The BraRoAZ E2 sib-lines yielded seed with a lower 
total oil content (Figure 6e; E2=43.1%, S2=48.9%, Wald 
statistic = 5261.3 on 1 df, P<0.001) and higher total pro- 
tein content (Figure 6f; E2=27.2%, S2=23.4%, Wald stat- 
istic = 4475.4 on 1 df, P<0.001) than the S2 controls. For 
both components there was a similar range of variation 
within the BraRoAZ_E3 population. 

There was increased variation in proportion of individ- 
ual fatty acids in the BraRoAZ E2 population compared 
with the controls (Figure 6g,l,j). Within the BraRoAZ E3 
population we observed a similar, but more extenuated 
variability, although here, increased spread was also 
observed in the S3 control lines, perhaps indicating 
greater GxE interactions (Figure 6h). In general, there 
was an increase in palmitic (E3=3.24%, S3=3.15%, Wald 
statistic = 60.2 on 1 df, P<0.001) and linoleic 
(E3= 19.45%, S3= 18.82%, Wald statistic = 88.2 on 1 df. 
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Figure 5 Immunodetection of ^'"C in (a) wild type S2 and (b) E2 
pachytene chromosome spreads from meiocytes. 4',6-Diamidino- 
2-Phenylindole (DAPI) staining indicated in blue, and anti-5mC 
detected using Alexa-488 conjugated secondary antibody indicated 
in green. E2 (b) sliows reduced ^"^C signal in non-heterochromatic 
regions compared with S2 (a). 



P<0.001) acids, corresponding to a decrease in erucic 
acid (£3=54.19%, S3=57.56%, Wald statistic = 629.9 on 1 
df, P<0.001) in most treated lines. Examination of the 
pairwise relationships between line means for individual 
fatty acids enabled us to infer that hypomethylation may 
have targeted specific steps in fatty acid elongation and/ 
or side-chain modification [43]. For example, the nega- 
tive correlation (r= -0.88) of erucic acid : linoleic acid is 
consistent [43] with up-regulation of FAD2, Further- 
more, there was a positive correlation (r=0.75) between 
palmitic acid and linoleic acid, and a negative correlation 
(r=-0.74) between palmitic acid and erucic acid. 

Stochastic hypomethylation and associated 
transcriptional changes in B, rapa 

Finally, we sought evidence for specific transcriptional 
changes associated with 5-AzaC-induced hypomethyla- 
tion in a representative treated B. rapa line. In these 
experiments, leaf RNA from two wild-type control lines 
and a single E3 line (BraRoAZ_12445e3, selected based 



on a phenotype of reduced seed size compared with S3 
seed) was hybridised against the Affymetrix GeneChip® 
Brassica Exon 1.0 ST Array, which is based on the 135k 
unigene set [44]. At a threshold of 1.2- fold change in 
transcription level, 10% of genes appeared to show a 
change in transcript level. At a higher threshold of 2- 
fold change, 3.7% of genes showed a change in transcript 
level, with 216 genes up-regulated in the BraR- 
oAZ_12445e3 plant compared with the R-o-18 wild-type 
control (Additional file 3: Table S3). Preliminary analysis 
of GO terms indicated that 79 genes classified as 
involved in the cellular component were upregulated in 
the BraRoAZ plant. Overall, a small number of genes 
were significantly up-regulated in this BraRoAZ plant 
(two fold threshold), while none appeared to be signifi- 
cantly down-regulated. 

Discussion 

We have performed a systematic dose-response charac- 
terisation of the effect of DNA hypomethylation induced 
by 5-AzaC in B. rapa. Based on this analysis, we gener- 
ated a unique hypomethylated population of 1,000 E2 
sib-lines with a stochastic distribution of reduced ^"^CG 
marks. Each line represents a unique combination of 
hypomethylated epialleles that are available for forward 
and reverse genetic screening. We carried out forward 
screening and investigated the range of variation for seed 
size and other yield components, as well as seed com- 
position. These epi-TILLING populations are analogous 
to those generated using conventional nucleotide muta- 
gens such as EMS [33,45]. However, they do offer the 
distinct opportunity to explore and exploit subtle inter- 
actions with phenotype that may be associated with 
epiallelic variation. 

Before adopting any treatment that seeks to reveal 
new epigenetic variation through interference in en- 
dogenous methylation control systems, it is important 
first to demonstrate efficacy of treatment and to opti- 
mise its application. 5-AzaC is a structural analogue of 
the nucleoside cytidine [26,27] that inhibits the action of 
the DNA methyl transferase METl, which in plants 
maintains methylation in exonic ^"^CG contexts. How- 
ever, there is scope for ineffective application should the 
5-AzaC fail to access the target tissues, or should the 
time of exposure be insufficient to elicit a response. 
Conversely, excessive application could lead to toxicity 
and/or lethal levels of mutation or demethylation [46]. It 
is therefore important to develop appropriate strategies 
to evaluate whether exposure levels were sufficient to 
ensure efficacy of demethylation whilst also minimising 
potentially damaging effects. We used frequency of sur- 
vivorship and the absence of gross phenotypic abnormal- 
ity to establish that the level of 5-AzaC applied was not 
excessive. The dose-response to 5-AzaC was sigmoidal 
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(See figure on previous page.) 

Figure 6 Boxplots displaying difference in seed size and composition traits recorded in the BraRoAZ E2 and E3 populations compared 
to control S2 or S3 populations, a) and b) Seed area recorded in tine E2 population compared to S2 control and E3 population compared to 
S3 controls, respectively, c) and d) show total seed weight per plant for the E2 and E3 populations, e) total oil content as a percentage of dry 
weight recorded in E2 compared to S2. f) total protein as a percentage of dry weight recorded in E2 compared to S2. g) and h) erucic acid as a 
% of the total seed oil measured by NIRS for E2 and E3 with respective controls, i) linoleic acid shows an increase in E2 compared to S2 and 
j) palmitic acid as a % of total oil content in E2 compared with S2 control. 

V J 



for several traits recorded, similar to that observed for 
chemical mutagens such as EMS. This suggests that the 
B. rapa genome is able to tolerate low concentrations up 
to -O.lmM 5-AzaC for the treatment period of three 
days during which mitoses are occurring in the meri- 
stematic cells. Above this concentration, an exponential 
response is observed up to -l.OmM 5-AzaC, above 
which there were no apparent additional lethal effects. 
Moreover, there were only minor cytotoxic effects 
observed within hypocotyls and cotyledon tissues in 
seedlings. In general, the El plants completed their life 
cycle at all concentrations, and successfully set seed. 

The potency of the 5-AzaC dose-response is likely to 
vary, depending on the dinucleotide composition and 
structural organisation of different genomes [29]. Com- 
pared with EMS in the same R-o-18 genotype [33], 5- 
AzaC gave rise to a greater proportion of plants with a 
normal visual phenotype, although the effects on fertility 
were similar. 

It is important next to verify that the 5 -AzaC treat- 
ment apparently leading to these effects is doing so by 
causing hypomethylation of the genome. We examined 
this in two ways. First, immune-localisation of methy- 
lated cytosines revealed essentially similar distributions 
of FISH signal across the euchromatic regions of pachy- 
tene chromosomes in treated (E2) and control plants 
but with a marked reduction of signal in treated lines. 
This is consistent with the expected genome-wide hypo- 
methylation associated with 5 -AzaC treatment and con- 
gruent with previous observations in Triticale [47]. 
Second, we compared the level of disparity between the 
MSAP product profiles generated by the isoschizomers 
Hpall (methylation sensitive) and MspL As expected, 
untreated control lines exhibited wide divergence be- 
tween the profiles generated by the two enzymes, largely 
because of their differential sensitivity to the presence of 
cytosine methylation. However, it was notable that this 
distinction had been lost in four of the treated lines 
sampled, but had been retained in others. The most 
plausible explanation of this finding is that hypomethyla- 
tion triggered by the 5-AzaC treatment had been sto- 
chastically effective and was only apparent in a subset of 
treated plants. This finding is consistent with the similar 
frequency of phenotypic abnormality seen in the E2 and 
E3 plants. Given the stochastic nature of 5-AzaC effi- 
cacy, this relatively simple approach has direct and 



valuable utility in allowing early identification of El lines 
where the treatment has actually affected the intended 
hypomethylation. This capacity would greatly improve 
the efficiency of any high-throughput programme seek- 
ing to use the chemical to generate new levels of pheno- 
typic variation, or otherwise to manipulate the 
methylation status for genetic or epigenetic studies of 
gene function. 

Having generated variation in methylation status, it is 
important next to determine the extent of associated 
phenotypic variation among the treated plants. In the 
present study, we certainly found additional phenotypic 
variation at the El generation when compared to un- 
treated controls, as reported previously in a range of 
plant species [10,48,49]. However, compared with B. 
oleracea [50], 5-AzaC appeared to give rise to a reduced 
range of morphological phenotypic effects in B, rapa R- 
0-18. This may be partly attributed to the smaller gen- 
ome size of the Brassica A genome [51] and/or different 
chromatin organisation, particularly in terms of relative 
amounts of heterochromatin and/or distribution of 
transposable elements [52]. Using a cytological ap- 
proach, Braszewska-Zalewska [53] showed that the B. 
rapa genome has a distinct distribution of DNA methy- 
lation, primarily located in the heterochromatin when 
compared with B. oleracea. 

There was also evidence that novel variation was 
present in several traits of agronomic importance. For 
example, flowering time is a complex multigenic trait 
that is regulated through the interaction of different sig- 
nal transduction pathways [54-56]. Our finding that 
flowering time in the El generation is delayed following 
exposure to > O.lmM 5-AzaC is consistent with methy- 
lation playing some role in the control of flowering time 
in B, rapa. The clone of B. rapa used (R-o-18) is 
unresponsive to vernalisation, in common with the 
Arabidopsis ecotype Landsberg erecta [57]. This pheno- 
type in Arabidopsis has been attributed to reactivation 
of FWA, a repressor of flowering which is constitutively 
methylated in its promoter and therefore repressed in 
the wild type at most stages of the life cycle. FWA is 
maternally imprinted [58] and is dependent on DNA 
methylation for its imprinted state [59,60]. However, we 
also observed a reduction in the time to flowering in a 
subset of E2 plants exposed to 0.5 mM 5-AzaC, a find- 
ing that is congruent with several previous reports 
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[10,48,50,61] Subsequent systematic screening of flow- 
ering time within the BraRoAZ E2 population treated 
with 0.25 mM 5-AzaC also indicated a reduced time to 
flowering. Curiously, in some cases early flowering phe- 
notypes were observed in the E2 generation from El 
lines had been late flowering, a phenomenon that has 
also been found in E2 and E3 populations of B. oleracea 
var. italica (King, unpublished data). This may be due 
to segregation of alleles in the E2 generation that affect 
different components of the flowering pathway and is 
perhaps indicative of the high level of redundancy and 
interdependency in the epigenetic control of this im- 
portant trait. 

The increased variability generated by stochastic hypo- 
methylation of the genome has most value if this vari- 
ation can be fixed or at least stabilized. Previous studies 
have demonstrated that this can be achieved in Arabi- 
dopsis through self-pollination [19] or doubled haploid 
production (King, unpublished). The dose-response 
analysis informed our selection of optimal concentration 
of 5-AzaC to use for generation of the BraRoAZ popula- 
tion, where E2 seed was set from 94.5% of El plants. 
The establishment of this population is analogous to an 
EMS -TILLING population and is available to be used 
for both forward and reverse epi-genetic screening. As 
expected, we observed a reduced range of phenotypes 
compared with EMS mutagenesis in the same R-o-18 
genotype [33]. 5-AzaC only targets a subset of methy- 
lated sites where these exist, whereas EMS induces tran- 
sition mutations which can lead to truncation of 
transcripts and modification of proteins, and is able to 
target all regions of the genome. 

In plants, DNA methylation is at least sometimes 
transmitted directly through meiosis and maintained 
through post-meiotic mitosis giving rise to gametophytes 
[62,63]. In two separate experiments (dose-response and 
BraRoAZ population) we found that certain phenotypes 
were transgenerationally inherited from El to E2 gener- 
ation following 5-AzaC treatment. These included an 
overall reduction in plant stature, small seed size and 
modified floral morphology. The apparent loss of vari- 
ation observed in E2 and E3 generations relative to El 
could arise for a number of reasons, including re- 
methylation of specific cell-lineages forming gametes, 
dominance of wild-type alleles and segregation of epi- 
static loci. The E2 and subsequent generations are 
expected to possess functional METl loci, and hence 
progressive re-methylation of specific CG sites may 
occur. The ability to distinguish between novel epi- 
alleles and collateral genetic variation, such as may result 
from activation of transposons, requires more detailed 
segregation and molecular analyses. 

The range of morphological variation we observed was 
comparable to that observed following down-regulation 



of genes involved in maintenance DNA methylation 
[6,14,64]. A key attribute of the hypomethylated popula- 
tion we have generated is that there has only been a 
transitory exposure to 5-AzaC, that inhibits METl activ- 
ity for the small number of cell divisions that occurred 
over three days during seed imbibition and meristem ac- 
tivation. In contrast, constitutive down regulation of 
METl (in mutants or by RNAi) affects a range of pheno- 
types including plant stature, leaves, apical dominance, 
flowering and fertility [6,14,65]. From preliminary ana- 
lysis of EMS mutation of BraA.METl.a we have evi- 
dence of effects on plant stature, leaf shape and fertility 
(data not shown). A similar range of abnormal pheno- 
types is observed in Arabidopsis ddml mutants [64,66]. 
Although developmental abnormalities were not initially 
reported in B. rapa following RNAi down-regulation of 
BraA.DDMl [67], we have observed some morphological 
abnormalities in subsequent generations. 

The structure of the BraRoAZ population provides a 
means for systematic screening of epiallelic variation. 
The distribution of ^"^CG marks retained in the E2 and 
subsequent generations is expected to be stochastic, 
leading to local variations in epiallelic status of adjacent 
genes. This offers several advantages over constitutive 
down-regulation of METl. For example, in metl 
mutants of Arabidopsis up to 90% reduction in cytosine 
methylation can occur, predominantly in CpG dinucleo- 
tides [14]. Availability of epiRILs in Arabidopsis gener- 
ated from a cross between wild type and the ddml 
mutant offers an alternative strategy for characterising 
epigenetic effects at target loci [19]. However, this ap- 
proach may not be so effective in more complex gen- 
omes such as Brassica, which contain multiple 
paralogous genie regions, and a larger load of transpos- 
able elements. 

We observed variation in a range of seed yield and 
composition traits, within both the E2 and E3 genera- 
tions. Epigenetic processes have been implicated in the 
regulation of seed size, with DNA methylation appearing 
to play a key role. For example, crosses between a hypo- 
methylated maternal genome and wild type paternal 
genome lead to increased seed size, whereas the recipro- 
cal crosses lead to reduced seed size [11,68,69]. For some 
lines we observed inheritance of the reduced seed size 
phenotype from E2 to E3 generations. We were able to 
demonstrate that the variation in seed size is largely in- 
dependent of seed number and seed yield per plant, sug- 
gesting that there is opportunity to select and recombine 
specific epialleles to increase overall yield. 

We found that hypomethylation had an effect across 
the population (E2 and E3) of increasing seed protein 
content, with a corresponding decrease in oil content. 
Given our null hypothesis was random variation around 
the wild-type mean for both components, this indicates 
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that hypomethylation could lead to de-repression of spe- 
cific genes that modulate the oibprotein ratio, either via 
specific resource allocation pathways, or through inter- 
action with the normal seed development and matur- 
ation programme. 

In addition, we have demonstrated that hypomethyla- 
tion can give rise to large modulations in the proportion 
of key seed fatty acid components, and that these are 
transmitted through at least one meiotic event. These 
effects include decreases in erucic acid that correspond 
to increases in linoleic and/or palmitic acid. This level of 
variation is consistent with that observed in surveys of 
natural genetic variation and population segregation 
within Brassica species [43]. The inference of which 
steps in the accepted fatty acid synthesis pathways are 
affected allows us to postulate which sets of genes may 
have been up-regulated as a result of hypomethylation. 
This either may be due to direct effects on a specific en- 
zyme or as a result of de-repression of an activator of a 
specific enzyme. For example, the reduction in the levels 
of erucic acid and corresponding increase in levels of 
linoleic acid is consistent with activation of FAD2 [43]. 
The increases in palmitic acid with concomitant increase 
in linoleic may indicate an activation of a gene earlier in 
the fatty acid elongation pathways. 

There was no consistent pattern of inheritance at a 
population level between the E2 and E3 for seed com- 
position traits. However, this is as expected for relatively 
complex traits in bulked sib-seed, where there may be 
strong GxE interactions compounded with segregation 
of one or more affected loci. Moreover, greater variation 
was observed in the S3 controls compared with S2, 
underlying the importance of studying epiallelic vari- 
ation in well controlled and reproducible environments. 
Variation in ^"^CG marks have been implicated in modu- 
lating GxE interactions in the vernalisation response of 
PLC and MAF-S [70]. DNA methylation marks also tend 
to exclude histone H2A.Z [7], which appears to play an 
essential role in temperature perception [71]. As with 
EMS TILLING populations the stochastic distribution of 
(epi-) mutant alleles in early generations is likely to mask 
the phenotypic effect of single loci. Thus it is necessary 
to generate backcross lines (to wild type) through at 
least three generations prior to carrying out a rigorous 
segregation analysis and identifying the relevant loci. 

Preliminary transcript analysis of leaf tissue from indi- 
vidual E3 plants did not show any evidence of widespread 
variation in gene expression, with only a small number of 
genes being up-regulated and none down- regulated. This 
is consistent with the observed phenotypic observations 
and de-repression of transcription expected, given the 
stochastic distribution of modified 5 mC sites that will 
have arisen from the original 5-AzaC treatment. Under 
controlled growth conditions on two distinct occasions. 



more variation was observed in transcript profile between 
homozygous wild type controls than was observed 
between two sib E3 plants grown on single occasions. 
More comprehensive screening of lines is required to 
understand the distribution and pattern of epiallelic 
variation affecting transcript levels within the population, 
and this may be amenable to targeted reverse epiallelic 
assays. 

Conclusions 

Generating stochastically hypomethylated populations 
may assist epiallele discovery across a wide range of crop 
plants, and has considerable potential as an intervention 
strategy for crop improvement [3]. This will require de- 
velopment of strategies that ensure stable retention and 
deployment of desirable epialleles within breeding ma- 
terial or seed multiplication stocks, and also to develop 
new techniques for targeted epigenetic modification. 
These strategies need to take into account issues of re- 
methylation, GxE interactions, as well as epigenomic re- 
modelling resulting from initial hybridisation events 
[72]. We have previously suggested [3] the need for 
rapid isolation of chromosomal segments carrying a 
modified epiallele through introgression. Targeted re- 
verse epi-genetic screening also requires detailed know- 
ledge of the distribution of ^"^C in crop genomes using 
high throughput bisulphite sequencing [21], together 
with locus-specific marker assays that detect epialleles 
using technologies, such as high resolution melts [73]. 

Methods 

Plant material and seed treatment with 5-AzaC 

R-o-18 is a rapid cycling self-compatible inbred line of 
B, rapa var. trilocularis (Roxb.) Hanelt (yellow sarson), 
and was originally obtained from John Innes Centre, 
Norwich, UK. Seed from accession number GK070302 
were treated with the demethylating agent 5-AzaCytidine 
(Sigma, UK), essentially as described by King [50]. A 
stock solution of 15 mM 5-AzaC was prepared in 
50 mM 2-[N-morpholino] ethanesulphonic acid (MES) 
buffer at pH 6.3. To establish dose-response, five differ- 
ent concentrations (0.01 mM, 0.1 mM, 0.5 mM, 1.0 mM 
and 1.5 mM) were prepared. 50 seeds were placed evenly 
on filter paper within 9cm Petri dishes to which 4.5 ml 
5-AzaC solution (or water, for controls) had been added, 
placed in a containment tray and incubated in the dark 
at 20°C for three days. The seeds were transferred to 
a sieve, washed with water at least eight times, and 
then sown in Rothamsted prescription mix compost in 
multi-celled seed trays. The trays were arranged in a 
randomised complete block design (RCBD) with four 
replicates of 100 seedlings for each of the six treatments. 
Unless stated otherwise, plants were grown in a glass- 
house with supplementary lighting with 16 hour day 
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length, providing a minimum of 300 (imol/m^/s of 
Photosynthetically Active Radiation. Plants were grown 
at nominal temperatures of 18°C day and 15°C night, 
with an air venting band of 2°C. However, during sum- 
mer 2008 a peak temperature of 32°C was recorded, 
which may have affected development of the E3 popula- 
tion (see next section). Seedlings were transplanted into 
compost in 9cm pots 14 to 17 days after initial potting. 
In order to ensure self-pollination, inflorescences were 
enclosed in micro-perforated pollination bags prior to 
anthesis. 

Establishment of hypomethylated population BraRoAZ 

"E" denotes epiallelic' by analogy with "M" for 'mutant'. 
The nomenclature of seed and plant generations follows 
that used previously for EMS mutant populations [33]. 
Thus El plants represent those growing from seed that 
were imbibed with 5-AzaC. E2 is the generation after El 
and so on (Figure 2). 

Sets of 250 seeds each of accession GK070302 were 
treated with 0.25 mM and 0.30 mM prepared from the 
same batch of stock 5-AzaC. The plants were grown in a 
RCBD which included two control sets, treated with 
water or with 2-[N-morpholino] ethanesulphonic acid 
(MES) buffer. Plants were monitored and phenotypic 
traits recorded during growth. Leaf material was col- 
lected from all 500 plants, freeze-dried and stored as 
archival material for molecular analysis. From each of 
500 El plants three siliques were harvested and assigned 
separate accession numbers (total 1500). Seeds from the 
remaining siliques on each plant were pooled and 
assigned accession numbers, and total seed weight per 
plant recorded. 

A subset of 77 lines from the two treatments was iden- 
tified based on observed abnormal El phenotype, along 
with an additional 23 lines of wild-type appearance. 
These 100 lines were used to generate the E2 generation 
of plants, and were grown with six control (untreated) 
lines. Ten seed from each selected line (generating 1000 
E2 plants, 500 per treatment) were sown in pots and ar- 
bitrarily placed within a glasshouse compartment. 

BraRoAZ_E3 seeds were harvested from specified sili- 
ques from all plants and assigned unique accession num- 
bers, totalling 1,000 lines. El plants which gave rise to 
E2 seeds were grown between October 2007 and March 
2008, while E2 plants which gave rise to E3 seeds were 
grown between May 2008 and November 2008. 

Phenotypic trait measurements 

Germination was assessed by counting the number of 
seeds that had germinated (by emergence of the radicle) 
seven days after imbibition. Plant height was recorded 
20 days after sowing, with a threshold of stunted' 
growth set at 9cm in comparison with a minimum of 



11cm among control plants. Days to first anthesis was 
recorded (from the date of sowing into compost). Plant 
survival was recorded 60 days after transplanting. Num- 
ber of seeds per silique and the average seed weight of 
25 seeds were recorded after harvest. Siliques were taken 
from nominated positions (sixth, seventh and eighth sili- 
ques, counting from the apex of the plant) of the pri- 
mary inflorescence. The presence or absence of pollen 
was recorded. E2 germination was recorded seven 
days after imbibition for four replicates of 30 seed 
per treatment on wetted filter paper in Petri dishes. 
Morphological features including plant stature, floral 
morphology and branching were recorded in both the 
El and E2 generations. 

Seed size was determined using a Marvin (Selecta 
Machinefabriek B.V., Enkhuizen, Netherlands) digital 
seed analyser with no more than 150 seeds being placed 
on a small seed tray. Number of seed and individual seed 
data were recorded using the Marvin 4.0 software, with 
a circularity setting of three. 100 E2 lines were sampled 
with 30 S2 water control lines. 

Non-destructive assessment of seed composition was 
carried out by Near InfraRed spectroscopy (MRS) using 
the NIRFlex N-500 (BUCHI UK Ltd, Oldham, UK) for 
approximately 200-500 seeds per line, with a 15 x 45 
mm glass vial in the NIRFlex Solids Vial add-on. Two 
independent readings for each line were taken on differ- 
ent days. Samples were analysed in a randomised order 
on two separate days using 32 scans for each sample. An 
S3 MES control sample was nominated as a reference 
standard and analysed in alternate runs to enable nor- 
malisation of drift in measurements recorded on separ- 
ate occasions. The % oil and protein is expressed on a 
dry matter basis whereas fatty acids are expressed as a % 
of total oil content. Standard fatty acid methyl esters 
(FAME) analysis was used to compare and validate NIRS 
assignments for specific components. Extraction was 
carried out for Ave seeds per vial, replicated five times for 
each of one wild type and three E2 lines BraR- 
oAZ_12175e2, BraRoAZ_12175e2 and BraRoAZ_12067e2, 
with Heptadecanoic acid (100 mg) added as an internal 
standard before processing [74]. 

MSAP profiling of DNA methylation in BraRoAz 
Plant material 

Ten seeds from two S2 (BraRoAZ_12845s2, BraR- 
oAZ_12857s2), two S3 lines (BraRoAZ_12842s3, BraR- 
oAZ_12854s3), five E2 (BraRoAZ_10263e2, BraRoAZ_ 
10287e2, BraRoAZ_10543e2, BraRoAZ_11543e2, BraR- 
oAZ_12447e2) and five E3 lines (BraRoAZ_10261e3, 
BraRoAZ_10286e3, BraRoAZ_10542e3, BraRoAZ_11541e3, 
BraRoAZ_12445e3) were sown in Rothamsted prescription 
mix compost in multi-celled seed trays and placed under 
controlled growth conditions (18°C day/15°C night, 16 
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hour day length, 300 (imol/m^/s of Photosynthetically 
Active Radiation). The first true leaf from each seedling 
was sampled at an equivalent developmental stage, frozen 
in liquid N2, stored at -80°C. The DNA was extracted 
using Qiagen DNeasy kit and diluted in nuclease free 
water to a final concentration of 10 ng/[il 

MSAP procedure 

We used a modification of the MSAP methods described 
by [75] to reveal global variability in CG methylation 
patterns between B. rapa samples. For each individual, 
50ng of DNA were digested and ligated for 2 h at 37°C 
using 5U of EcoR\ and lU of Msp\ or Hpall (New 
England Biolabs), 0.45 \iM EcoRl adaptor, 4.5 \iM Hpall 
adaptor (see Additional file 2: Table SI for oligonucleo- 
tide sequences) and lU of T4 DNA ligase (Sigma) in 11 
[il total volume of IX T4 DNA ligase buffer (Sigma), 1 
|il of 0.5M NaCl, supplemented with 0.5 [A at Img/ml of 
BSA. Enzymes were then inactivated by heating to 75°C 
for 15 min. Following restriction and adaptor ligation 
there followed two successive rounds of PGR amplifica- 
tion. For preselective amplification, 0.3 [A of the restric- 
tion/ligation products described above were incubated in 
12.5 \i\ volumes containing IX Biomix (Bioline, London, 
UK) with 0.05 [A of Preamp£coi?I primer and 0.25 [A 
PredimpHpall/Mspl (both primers at 10 uM) (see 
Additional file 2: Table SI primers sequences) supple- 
mented with 0.1 \A at Img/ml of BSA. PGR conditions 
were 2 min at 72_G followed by 30 cycles of 94_G for 30 
s, 56_G for 30 s and 72_G for 2 min with a final extension 
step of 10 min at 72°G. Selective PGR reactions were per- 
formed using 0.3 [A of preselective PGR reaction product 
and the same reagents as the preselective reactions but 
using FAM labeled selective primers (E2/H1 or E3/H3; 
see Additional file 2: Table SI for primer sequence). Gyc- 
ling conditions for selective PGR were as follows: 94°G 
for 2 min, 13 cycles of 94°G for 30 s, 65°G (decreasing by 
0.7°G each cycle) for 30 s, and 72°G for 2 min, followed 
by 24 cycles of 94°G for 30 s, 56°G for 30 s, and 72°G for 
2min, ending with 72°G for 10 min. 

Fluorescently labeled MSAP products were diluted 
1:10 in nanopure sterile water and 1 [A was combined 
with 1 [A of ROX/HiDi mix (50 [A ROX plus 1 ml 
of HiDi formamide. Applied Biosystems, USA). Sam- 
ples were heat-denatured at 95°G for 3-5 min and 
snap-cooled on ice for 2 min. Samples were fractio- 
nated on an ABI PRISM 3100 at 3 kV for 22 s and at 
15 kV for 45 min. 

Statistical Analysis 

Analysis of morphological variance 

GenStat® (2010, 13* edition, VSN International Ltd, 
Hemel Hempstead, UK) statistical package was used for 
all analyses. ANOVA was used to analyse trait variation 



and the significance of differences between the 5-AzaG 
concentrations was determined using the F-test, and 
least significant difference (LSD) at the 5% level was 
used to compare means [76]. Analysis of residuals indi- 
cated no requirement for transformation of the data. A 
logistic curve modified to start at 100% (survived or nor- 
mal plants) was used to model the dose response data. 
The model was fitted using nonlinear least squares re- 
gression, to estimate the three possible parameters (the 
exponential rate, the dose giving 50% survived or normal 
plants and the lower asymptote) [76]. F-test was used to 
assess whether a lower asymptote was required, and to 
assess lack-of-fit of the model. Restricted Maximum 
Likelihood (REML, [77] Section 5.3.3) analysis was used 
to estimate variation within and between populations of 
treated and control lines of the same generation. Differ- 
ences between populations in seed area were assessed 
using Wald tests ([77] Section 6.2) and differences in 
population variances were assessed using likelihood ratio 
tests ([77], Section 6.3). 

Analysis of epigenetic variance 

Epigenetic similarity between tested samples was deter- 
mined by Principal Goordinate Analysis (PGoA) [78] 
based on the MSAP profiles obtained from primer com- 
binations E2/H1 and E3/H3 using GenAlex (v.6.4). We 
then used Analysis of Molecular Variance (AMOVA) 
[79] to evaluate the structure and degree of epigenetic 
diversity induced by 5-AzaG. Pairwise PhiPT [80] com- 
parisons (an analogue of the F^t fixation index, that mea- 
sures differential connectivity/genetic diversity among 
populations) between samples from a line restricted with 
Hpall or Mspl was used to infer their overall level of di- 
vergence in DNA methylation within the targeted re- 
striction sites (i.e., the lower the PhiPT value between 
samples restricted using Hpall and the same samples 
restricted with Mspl, the lower the level of DNA methy- 
lation in those samples). AMOVA was subsequently cal- 
culated using GenAlex (v.6.4) to test the significance of 
PhiPT between populations [80], with the probability of 
non-differentiation (FST=0) being estimated over 9,999 
permutations. The sum of squares within population 
SSWP [80] was thereby used to infer epigenetic variation 
within lines. 

Cytological localisation of ^"^C by immune-staining 

Four plants from two E2 lines and an S2 water control 
line were grown in controlled environment conditions of 
18-15°G, 16 hour daylength and relative humidity of 65 
-75%. Immature flower buds (1.2mm) were sampled and 
fixed as described by [42] to obtain pachytene stage 
chromosomes from meiotic pollen mother cells. Detec- 
tion of ^^C was performed as described in [47] using 
mouse monoclonal anti-^"^G antibody (Galbiochem, UK). 
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Transcriptome analysis 

Total RNA was extracted using Qiagen RNeasy plant kit 
from leaf 6 of three separate plants of E3 line BraR- 
oAZ_12445e3 (phenotypically indistinguishable from 
control, with small seeds), and two separate plants of R- 
0-18. Labelling and hybridisation to the Affymetrix 
GeneChip Brassica Exon 1.0 ST Array [40] was carried 
out by the NASC Arrays service (www.arabidopsis.info). 
Results were analysed using GeneSpring v. 11.5, with GO 
analysis essentially following that described by [44] . 

Seed of the BraRoAZ_E2 and BraRoAZ_E3 populations 
are available from smita.kurup@rothamsted.ac.uk. Re- 
sequencing genomic data for R-o-18 are available online at 
http://www.brassica.info/datasets/Brassica_resequencing_- 
data/. 
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